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Abstract 

In this paper we study gossip based information spreading with bounded message sizes. We 
use algebraic gossip to disseminate k distinct messages to aU n nodes in a network. For arbitrary 
networks we provide a new upper bound for uniform algebraic gossip of 0{{k + logn + D)A) 
rounds with high probability, where D and A arc the diameter and the maximum degree in the 
network, respectively. For many topologies and selections of k this bound improves previous 
results, in particular, for graphs with a constant maximum degree it implies that uniform gossip 
is order optimal and the stopping time is 0{k + D). 

To eliminate the factor of A from the upper bound we propose a non-uniform gossip protocol, 
TAG, which is based on algebraic gossip and an arbitrary spanning tree protocol S. The stopping 
time of TAG is 0{k + log n + d{S) + t{S)), where t{S) is the stopping time of the spanning tree 
protocol, and d{S) is the diameter of the spanning tree. We provide two general cases in which 
this bound leads to an order optimal protocol. The first is for k — ri(n), where, using a simple 
gossip broadcast protocol that creates a spanning tree in at most linear time, we show that 
TAG finishes after <d{n) rounds for any graph. The second uses a sophisticated, recent gossip 
protocol to build a fast spanning tree on graphs with large weak conductance. In turn, this 
leads to the optimally of TAG on these graphs for k — r2(polylog(n)). The technique used in 
our proofs relies on queuing theory, which is an interesting approach that can be useful in future 
gossip analysis. 
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1 Introduction 



One of the most basic information spreading applications is that of disseminating information stored 
at a subset of source nodes to a set of sink nodes. Here we consider the k- dissemination case: k 
initial messages {k < n) located at some nodes (a node can hold more than one initial message) 
need to reach all n nodes. The all-to-all communication - each of n nodes has an initial value that 
is needed to be disseminated to all nodes - is a special case of A;-dissemination. The goal is to 
perform this task in the lowest possible number of time steps when messages have limited size (i.e., 
a node may not be able to send all its data in one message). 

Gossiping, or rumor-spreading, is a simple stochastic process for dissemination of information 
across a network. In a synchronous round of gossip, each node chooses a single neighbor as the 
communication partner and takes an action. In an asynchronous time model a single node wake-ups 
and chooses the communication partner and n consecutive steps are considered as one round. The 
gossip communication model defines how to select this neighbor, e.g., uniform gossip is when the 
communication partner is selected uniformly at random from the set of all neighbors. We then 
consider three possible actions: either the node pushes information to the partner (PUSH), pulls 
information from the partner (PULL), or does both (EXCHANGE), but here we mostly present results 
about EXCHANGE. 

A gossip protocol uses a gossip communication model in conjunction with the choice of the 
particular content that is exchanged. Due to their distributed nature, gossip protocols have gained 
popularity in recent years and have found applications both in communication networks (for exam- 
ple, updating database replicated at many sites [9l[l6], computation of aggregate information [17] 
and multicast via network coding ^ , to name a few) as well as in social networks [181 [6] . 

In the current work we analyze algebraic gossip which is a type of network coding known as 
random linear coding (RLNC) [201 [T9] that uses gossip algorithms for all-to-all communication and 
/c-dissemination. In algebraic gossip the content of messages is the random linear combination of all 
messages stored at a sender. Once a node has received enough independent messages (independent 
linear equations) it can solve the system of linear equations and discover all the initial values of 
all other nodes. It has been proved [H] that network coding can improve the throughput of the 
network by better sharing of the network resources. Note, however, that in gossip protocols, nodes 
select a single partner, so for /c-dissemination to succeed each node needs to receive at least k 
messages (of bounded size), hence at least a total of kn messages need to be sent and received. 
This immediately leads to a trivial lower bound of ^{k) rounds for fc-dissemination. 

We study uniform and non-uniform algebraic gossip both in the synchronous and the asyn- 
chronous time models on arbitrary graph topologies. The stopping time obviously depends on the 
protocol, the gossip communication model, the graph topology, but also on the time model, as sown 
in other cases [11]. We now give an overview of our results followed by a discussion of previous 
work. 

1.1 Overview of Our Results 

Our first set of results is about the stopping time of uniform algebraic gossip. In [2] we have shown 
a tight bound of B(n) for all-to-all communication for graphs with constant maximum degree. To 
prove this, we used a reduction of gossip to a network of queues and analyzed the waiting times 
in the queues. Bounding the general fc-dissemination case is significantly harder, despite some 
similarity in the tools used. Unless explicitly stated, all our results are for gossip using EXCHANGE 
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and are with high probabihtjll]. 

We provide a novel upper bound for uniform algebraic gossip of 0{{k + logn + D)A) where D 
is the diameter and A is the maximum degree in the graph. For graphs with constant maximum 
degree this leads to a bound of 0{k + D). For the synchronous case we have a matching lower 
bound of Q{k + D) which makes uniform algebraic gossip an order optimal gossip protocol for 
these graphs. We conjecture that the optimality holds for the asynchronous time model as well, 
but only show it when k = Vt[D). 

However, there are topologies for which uniform algebraic gossip performs badly, e.g., in the 
barbell graph (two cliques connected with a single edge) it takes VL{'n?) rounds to perform all-to-all 
communication [2j . This is usually the result of bottlenecks that exist in the graph and lead to low 
conductance. For such "bad" topologies we propose here a modification of the uniform algebraic 
gossip called Tree based Algebraic Gossip (TAG). The basic idea of the protocol is that it operates 
in two phases: first, using a gossip protocol S it generates a spanning tree in which each node in the 
tree has a single parent. In the next phase, algebraic gossip is performed on the tree where each node 
does EXCHANGE with its parent. Let t{S) and d{S) be the stopping time of S and the diameter of 
the tree generated by 5, respectively. For any spanning tree gossip protocol S we prove for TAG an 
upper bound of: 0{k + \ogn + d{S) + t[S)) for the synchronous and the asynchronous time models. 
As a special case of a spanning tree protocol, one can use a gossip broadcast (or 1-dissemination) 
protocol B - a protocol in which a single message originated at some node should be disseminated 
to all nodes. Interestingly, using a gossip broadcast for the spanning tree construction in TAG, 
eliminates the dependence on the diameter of the spanning tree in the synchronous time model, 
i.e., if we use B as S, we obtain the bound of 0{k + logn -|- t{B)) rounds. For a general spanning 
tree protocol S, it follows directly that if A; = i}{m.ax{logn,d{S),t{S))), TAG is an order optimal 
with a stopping time of Q{k). We provide two examples of this scenario: the first example leads 
to the most significant result of the paper. Using a simple round-robin-based broadcast we show 
that TAG is an order optimal gossip protocol for k- dissemination in any topology when k = il(n). 
This imply, somewhat surprisingly, that for any graph, if k = i}(n), TAG finishes in 0(n) rounds. 
In the barbell graph mentioned above, TAG leads to a speedup ratio of n compare to the uniform 
algebraic gossip. The second example makes use of a recent non- uniform information dissemination 
protocol from [5| that works well on graphs G with large weak conductance denoted by ^dG) for 
a parameter c (see Section [6]). We provide sufficient conditions on k, c and $c(G) that make TAG 
order optimal when using the protocol of [5j as a spanning tree protocol. Table [T] summarizes our 
main results of the paper and next, we discuss previous results. 

1.2 Related Work 

Uniform algebraic gossip was first proposed by Deb et al. in [8]. The authors studied uniform 
algebraic gossip using PULL and PUSH on the complete graph and showed a tight bound of Q{h), 
for the case oi k = oj{log'^{n)) messages. Boyd et al. [3l S] studied the stopping time of a gossip 
protocol for the averaging problem using the EXCHANGE algorithm. They gave a bound for symmetric 
networks that is based on the second largest eigenvalue of the transition matrix or, equally, the 
mixing time of a random walk on the network, and showed that the mixing time captures the 
behavior of the protocol. Mosk-Aoyama and Shah [23] used a similar approach to [S] Hj to first 
analyze algebraic gossip on arbitrary networks. They consider symmetric stochastic matrices that 

^An event occurs with high probabiHty {w.h.p. ) if its probability is of at least 1 — 0(7^)- 
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Protocol 


Graph 


Synchronous 


Asynchronous 


Uniform AG 


any graph 


0{{k + logn + D)A) 


constant max degree 


0(k + D) 


0{k + D) (*) 


TAG 


any graph 


0{k + log n + d{S) + t{S)) 


0{k + log n + t{B)) 


0{k + log n + d{B) + t{B)) 


k = ^}{n), any graph 


0(n) 


c = 0(logP(n)) 
k = fi(log2P+3 (^)) 


0(k) 


0{k + d{IS)) (**) 



Table 1: Overview of the main results of the paper. Bold text and indicate order optimal 
result. (*) we prove an upper bound but conjecture it is optimal. (**) we prove the upper bound 
but conjecture is should be Q{k). 5 is a spanning tree protocol, ;B is a broadcast protocol, and IS 
is an information dissemination gossip protocol from [5]. 

(may) lead to a non-uniform gossip and gave an upper bound for the PULL algorithm that is based 
on a measure of conductance of the network. As the authors mentioned, the offered bound is not 
tight, which indicates that their conductance-based measure does not capture the full behavior of 
the protocol. 

In [2], we used queuing theory as a novel approach for analyzing algebraic gossip. We then gave 
an upper bound of O(nA) rounds for any graph for the case of all-to-all communication, where A 
is the maximum degree in the graph. In addition, a lower bound of Q.{n?') was obtained for the 
barbell graph - the worst case graph for algebraic gossip. The bounds (upper and lower) in [2] were 
tight in the sense that they matched each other for the worst case scenario. The parameter A is 
simple and convenient to use, but, it does not fully capture the behavior of algebraic gossip. While 
it gives optimal (0(n)) result for any constant-degree graphs (e.g., line, grid), it fails to reflect the 
stopping time of algebraic gossip on the complete graph, for example, by giving the O(n^) bound 
instead of 0{n). 

A recent (yet, unpublished) work of Haeupler [13] is the most related to our work. Haeupler's 
paper makes a significant progress in analyzing the stopping time of algebraic gossip. While all 
previous works on algebraic gossip used the notion of helpful message/node to look at the rank 
evaluation of the matrices each node maintains (this approach was initially proposed by [8] ) , Haeu- 
pler used a completely different approach. Instead of looking on the growth of the node's subspace 
(spanned by the linear equations it has), he proposed to look at the orthogonal complement of the 
subspace and then analyze the process of its disappearing. This elegant and powerful approach led 
to very impressive results. First, a tight bound of Q{n/^) was proposed for all-to-all communica- 
tion, where 7 is a min-cut measure of the a related graph. This bound perfectly captures algebraic 
gossip behavior for any network topology. For the case of fc-dissemination, the author gives a con- 
jecture that the upper bound is of the form of 0{k + T) where T is the time to disseminate a single 
message to all the nodes. But formally, the bound that is proved is 0{k/^ + log^n/A) where A 
is a conductance-based measure of the graph. The work in [13] implicitly considered the uniform 
algebraic gossip, but could be extend to non-uniform cases. It is therefore hard to compare TAG to 
the results of [13] , nevertheless, our bounds for the uniform algebraic gossip are better for certain 
families of graphs. Table [2] presents few such examples. 
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Graph 


0(/c/7 + log2n/A)/n [IS] 


0{{k + \ogn + D)A) [here] 


Improvement factor 


Line 


0{k + nlog^ n) 


Oik + n) 


log^ n 


Grid 


0{k + -^/n log^ n) 


Oik + y/E) 


log^ n for = 0{^/n) 


Binary Tree 


0{k + nlog^ n) 


0{k + log n) 





Table 2: Comparison of our results with [13] 



To give a quick summary of our results and previous work, the two main contributions of the 
paper are i) we prove that for graphs with constant maximum degree uniform algebraic gossip is 
order optimal for A;-dissemination in the synchronous time model and ii) we offer a new non-uniform 
algebraic gossip protocol, TAG, that is order optimal for large selections of graphs and k. The rest 
of the paper is organized as follows: in Section [2] we give definitions. Section [3] proves results for 
uniform algebraic gossip and Section [3| presents the TAG protocol and its general bound. Sections 
[5|and[6l then, discuss cases where TAG is optimal. 

2 Preliminaries 

We model the communication network by a connected undirected graph G„ = Gn{y,E), where V 
is the set of vertices and E is the set of edges. Number of vertices in the graph is \V\ = n. Let 
N{v) C y be a set of neighbors of node v and dy = \N{v)\ its degree, let A = maxyd^ be the 
maximum degree of G„, and let D be the diameter of the graph. 

We consider two time models: asynchronous and synchronous. In the asynchronous time model 
at every timeslot, one node selected independently and uniformly at random, takes an action 
and a single pair of nodes communicate^. We consider n consecutive timeslots as one round. In 
the synchronous time model at every round, every node takes an action and selects a single 
communication partner. It is assumed that the information received in the current round will be 
available to a node for sending only at the beginning of the next round. A Gossip communication 
model (sometimes called gossip algorithm) defines the way information is spread in the network. In 
the gossip communication model, a node that wakes up (according to the time model) can initiate 
communication only with a single neighboiH (i.e., communication partner). The model describes 
how the communication partner is chosen and in which direction (to - PUSH, from -PULL, or both 
- EXCHANGE) the message is sent. In this work we use the following communication models: 

Definition 1 (Uniform Gossip). Uniform gossip is a gossip in which a communication partner is 
chosen randomly and uniformly among all the neighbors. 

Definition 2 (Round-Robin (TZTZ) Gossip). In round-robin gossip, the communication partner is 
chosen according to a fixed, cyclic list, of the nodes ' neighbors. This list dictates the order in which 
neighbors are being contacted. If the initial partner is chosen at random, this gossip communication 
model is known as the quasirandom rumor spreading model IH [70| /. 

"^Alternatively, this model can be seen as each node having a clock which ticks at the times of a rate 1 Poisson 
process and there is a total n clock ticks per round [3] . 

^Note that this implies that in the synchronous model a node can communicate with more than a single neighbor, 
if other nodes initiate communication with it. 
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Gossip Protocols define the task and the message content. In turn, a gossip protocol can use 
any of the gossip communication models defined above (and others). We will use two types of 
gossip protocols here. The first is STP Gossip - protocols whose task is to create a spanning tree 
of the graph. The goal of a Gossip STP protocol S is that every node, except a node which is the 
root, will have a single neighbor called the parent. Note that one simple way to generate a spanning 
tree is by using a 1-dissemination protocol, namely a broadcast protocol. 

The second protocol, is a fc-dissemination protocol called Algebraic Gossip. In algebraic gos- 
sip, every message sent by a node is sent according to the random linear coding (RLNC) technique 
which is described next. As mentioned, algebraic gossip can use any of the communication models 
presented above. 

Random Linear Network Coding (RLNC) The random linear network coding approach is 
used in algebraic gossip for building outgoing messages to achieve fast information dissemination. 
Let ¥q be a field of size q. There are k < n initial messages (xi, x^) that are represented as 
vectors in F^. We can represent every message as an integer value bounded by M, and therefore, 
r = l^logg(M)]. All transmitted messages have a fixed length and represent linear equations over 
¥q. The variables (unknowns) of these equations are the initial values Xj S F^, 1 < i < k and a 
message contains the coefficients of the variables and the result of the equation; therefore the length 
of each message is: rlog2(? + klog2q bits (and it is usually assumed that r S> n [8]). A message 
is built as a random linear combination of all messages stored by the node and the coefficients are 
drawn uniformly at random from Fg. A received message will be appended to the node's stored 
messages only if it is independent of all linear equations (messages) that are already stored by the 
node and otherwise it is ignored. Nodes store messages (linear equations) in a matrix form and 
once the dimension (or rank) of the matrix becomes k, a node can solve the linear system and 
discover all the k messages. 

The following definition is necessary for understanding the concept of helpfulness in the analysis 
of algebraic gossip. 

Definition 3 (Helpful node and helpful message). We say that a node x is a helpful node to 

a node y if and only if a random linear combination constructed by x can be linearly independent 
with all equations (messages) stored in y. We call a message a helpful message if it increases the 
dimension (or rank) of the node (i.e., the rank of the matrix in which the node stores the messages). 

3 /c-dissemination with Uniform Algebraic Gossip 

The main result of this section is that uniform algebraic gossip is order optimal A;-dissemination for 
graphs with constant maximum degree and for any selection of A;. It is formally stated in Theorem 
[3] and is an almost direct result of the following general bound for uniform algebraic gossip: 

Theorem 1. For any connected graph Gn, the stopping time of the uniform algebraic gossip protocol 
with k messages is 0((A;+log n+L))A) rounds for synchronous and asynchronous time models w.h.p. 

The idea of the proof relies on the queuing networks technique we presented in [2j. The major 
steps of the proof are: 

• Perform a Breath First Search (BFS) on Gn starting at an arbitrary node v. The search 
results in a directed shortest path spanning tree T„ rooted at v. The maximum depth /max 
of the tree T„ rooted at v is at most D. 
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• Reduce the problem of algebraic gossip on a tree to a simple system of queues Q^^^^ 
rooted at v, where at each node we assume an infinite queue with a single server. Every 
initial message becomes a customer in the queuing system. The root v finishes once all the 
customers arrive at it. 

• Show that the stopping time of the tree topology queuing system - Q^^^^-, is 0{{k + logn + 
^max)'T'A) timeslots w.h.p. So, we obtain the stopping time for the node v. 

• Use union bound to obtain the result for all the nodes in Gn- 

Just before we start the formal proof of Theorem [H we present an interesting theorem related 
to queuing theory. The theorem gives the stopping time of the feedforward queuing system [7j 
arranged in a tree topology. Consider the following scenario: n identical M/M/1 queues arranged 
in a tree topology. There are no external arrivals, and there are k customers arbitrarily distributed 
in the system. In the feedforward network, a customer can not enter the same queue more than 
once, thus, customers eventually leave the system via the queue at the root of the tree. We ask the 
following question: how much time will it take for the last customer to leave the system? 

Theorem 2. Let Q^^^ be a network of n nodes arranged in a tree topology, rooted at the node v. 
The depth of the tree is /max- Each node has an infinite queue, and a single exponential server with 
parameter fx. The total amount of customers in the system is k and they are initially distributed 
arbitrarily in the network. The time by which all the customers leave the network via the root node 
V is t{Q^^'^^) = 0{{k + ?max + logn)//i) timeslots with probability of at least 1 — 

The main idea of the proof is to show that the stopping time of the network Q^ '^^ (i-e., the time 
by which all the customers leave the network) is stochasticallj0 smaller or equal to the stopping 
time of the systems of /max queues arranged in a line topology - Q/*"*^- Then, we make the system 
'^/^a'x stochastically slower by moving all the customers out of the system and make them enter 
back via the farthest queue with the rate A = fi/2. Finally, we use Jackson's Theorem for open 
networks to find the stopping time of the system. See Fig. [1] for the illustration. The full proof of 
the above theorem can be found in the appendix. We can now prove Theorem [1] 

Proof of Theorem [21 We start the analysis of the uniform algebraic gossip with k messages and the 
asynchronous time model. First, we perform a Breath First Search (BFS) on G„ starting at an 
arbitrary node v. The search results in a directed shortest path spanning tree T„ rooted at v. The 
depth of Tn is /max, and since T„ is the shortest path tree, /max < D, where D is the diameter of 
the graph. On the tree T„, consider a message fiow towards the root v from all other nodes. Once 
k helpful messages arrive at v, it will reach rank k and finish the algebraic gossip protocol. We 
ignore messages that are not sent in the direction of v. Ignoring part of messages can only increase 
the stopping time of the algebraic gossip protocol. 

We define a queuing system Q^*^^ by assuming an infinite queue with a single server at each 
node. The root of Q^^^ is the node v. Customers of our queuing network are helpful messages, i.e., 
messages that increase the rank of a node they arrive at. This means that every customer arriving 
at some node increases its rank by 1. When a customer leaves a node, it arrives at the parent node. 
The queue length of a node represents a measure of helpfulness of the node to its parent, i.e., the 
number of helpful messages it can generate for it. 

*For completeness, stochastic dominance is formally defined in appendix. 
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Figure 1: Reduction of AG to a system of queues, (a) - Initial graph G„. (b) - Spanning tree r„. (c) 
- System of queues Qn'^'^- (d) - System of queues Q;™'^- Stopping time of Q;™"^ is larger than of Q^'^'^- 
(e)-Taking all customers out of the system and use Jackson theorem for open networks. 



The service procedure at a node is a transmission of a helpful message towards the node v (from 
a node to its parent). Lemma 2.1 in [8] gives a lower bound for the probability of a message sent 
by a helpful node to be a helpful message, which is: 1 ~ |- In the uniform gossip communication 
model, the communication partner of a node is chosen randomly among all the node's neighbors 
in the original graph Gn- The degree of each node in Gn is at most A. Thus, in the asynchronous 
time model, in a given timeslot, a helpful message will be sent over the edge in a specific direction 
with probability of at least (1 — |)/nA, where ^ is the probability that a given node wakes up in 
a given timeslot, is the minimal probability that a specific partner (the parent of the node) will 
be chosen, and 1 — | is the minimal probability that the message will be helpful. Thus, we can 
consider that the service time in our queuing system is geometrically distributed with parameter 
p > (1 — |)/nA, and since g > 2, we can assume the worst case: p = i^;^- 

Lemma 2 in [2] shows that we can model the service time of each server as an exponential 
random variable with parameter fj, = p, since in this case, exponential servers are stochastically 
slower than geometric. Such an assumption can only increase the stopping time. 

Theorem [2] with fi = p gives us an upper bound for the stopping time of the node v, ty = 
0((fe + /max + logn)2nA) timeslots with probability of at least 1 — Since the depth of every BFS 
tree is bounded by the diameter D, using a union bound we obtain the upper bound (in timeslots) 
for all the nodes in Gn'- 

Pr I Pi = 0{{k + logn + D)2nA) j > 1 - -. (1) 

Thus we obtain the upper bound for uniform algebraic gossip: 0{{k + logn + -D)A) rounds. 
Next, we show that this bound holds also for the synchronous time model. The proof for the 
synchronous time model is almost the same as in the asynchronous case, except for the following 
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change. Instead of dividing time into timeslots, we measure it by rounds (1 round = n timeslots). In 
a given round, a helpful message will be sent over the edge in a specific direction with probability 
p > (1 — |)/A, where the ^ is the minimal probability that a specific partner (the parent of the 
node) will be chosen, and 1 — | is the minimal probability that the message will be helpful. Since 

q > 2, we can assume the worst case: P = 2k'' "^^^ difference from the asynchronous model is the 
factor of n in p, since in the synchronous model, every node wakes up exactly once in a each round. 
Moreover, in the synchronous case (and in the EXCHANGE gossip variation) there is a possibility 
to receive 2 messages from the same node in one round (in the asynchronous time model it was 
impossible to receive 2 messages from the same node in one timeslot). We assume that if a node 
receives 2 messages from the same node at the same round, it will discard the second one. Such an 
assumption can only increase the stopping time of the protocol, and will make our analysis simpler. 
From that point on, the analysis is exactly the same as in the asynchronous case since Theorem [2] 
does not depend on the time model. □ 

3.1 Optimality for Constant Maximum Degree Graphs and Synchronous Time 

Following Theorem [1] we can state the main results of the section: 

Theorem 3. For any connected graph Gn with constant maximum degree, the stopping time of 
the uniform algebraic gossip protocol with k messages is Q{k + D) in the synchronous time and 
0{k + D) in the asynchronous time w.h.p. 

Proof. To show the upper bound the following simple claim is proved in the appendix: 

Claim 1. For any connected graph Gn with a constant maximum degree (A = 0(1)), the diameter 
of Gn is ri(logn). 

Now, using Claim [1] and fact the the maximum degree is constant the upper bound follows. For 
the lower bound note that in order to disseminate k messages to n nodes, at least kn transmissions 
should occur in the network. In synchronous time model, kn transmissions require at least k/2 
rounds, since every round at most 2n messages are sent (2 transmissions per communication pair). 
In the asynchronous time model, kn transmissions require at least kn/2 timeslots, since at each 
timeslot at most 2 nodes transmit (due to EXCHANGE). Thus, in both time models, il(/c) rounds are 
required. Moreover, in the synchronous time model, dissemination of a single message will take at 
least D/2 rounds, since in this model, a message can travel at most one hop in a single round. So, 
for the synchronous time model, the bound Q{k + D) is tight and optimal. □ 

4 TAG: /c-dissemination with Tree-based Algebraic Gossip 

We now describe the protocol TAG (Tree based Algebraic Gossip), which is a fc-dissemination 
gossip protocol that exploits algebraic gossip in conjunction with a spanning tree gossip protocol 

5 (see Sec. [2]). Given a connected network of n nodes and k messages xi, that are initially 
located at some nodes, the goal of the protocol TAG is to disseminate all the k messages to all the 
n nodes. The protocol consists of two phases. Both phases are performed simultaneously in the 
following way: if a node wakes up when the total number of its wakeups until now is even, it acts 
according to Phase 1 of the protocol. If the node wakes up when the total number of its wakeups 
until now is odd, it acts according to Phase 2 of the protocol. 



8 



Protocol TAG Psciido code for node v. Example for asynchronous time model. 
Require: N{v), k, gossip spanning tree protocol S 
Initialize: parent = null 

On odd wakeup: // Phase 1: EXCHANGE gossip spanning tree protocol S 

1; choose parter u € N{v) and exchange messages with it according to S 
2: according to S decide if parent = u 

On even wakeup: // Phase 2: EXCHANGE algebraic gossip 

3: if obtained parent during the protocol S then 

4: exchange messages with parent according to algebraic gossip (RLNC) 

On ccMitact, I'roni oilier node w G 
5: if w performs Phase 1 then 
6: exchange messages with w according to S 
7: according to S decide if parent = w 
8: else(w performs Phase 2) 

9: exchange messages with w according (o algeliraie gossip (E.LXC) 



• In Phase 1, a node performs a spanning tree gossip protocol S. Once a node becomes a part 
of the spanning tree, it obtains a parent. 

• In Phase 2, a node is idle until it obtains a parent in Phase 1. From now on, in Phase 2, the 
node will perform an EXCHANGE algebraic gossip protocol with a fixed communication partner 
- its parent. 

The following theorem gives an upper bound on the stopping time of the protocol TAG. 

Theorem 4. Let t(S) be the stopping time of the gossip spanning tree protocol S performed at 
Phase 1, and let d{S) he the diameter of the spanning tree created by S. For any connected graph 
Gn, the stopping time of the k-dissemination protocol TAG, is: 

t{TAG) = 0{k + logn + d{S) + t{S)) rounds (2) 

for synchronous and asynchronous time models, and w.h.p. 

Proof. In order to prove this theorem, we will find the time needed to finish TAG, after Phase 1 is 
completed. Once Phase 1 is completed, every node knows its parent and thus, in Phase 2, we have 
the algebraic gossip EXCHANGE protocol on the spanning tree r„, where communication partners of 
the nodes are their parents. The following lemma gives an upper bound on the stopping time of 
such a setting. 

Lemma 1. Let Tn be a tree with n nodes, rooted at the node r, with depth /max- There are k initial 
messages located at some nodes in the tree. Consider algebraic gossip EXCHANGE protocol with the 
following communication model: the communication partner of a node is fixed to he its parent in 
Tn during the whole protocol. Then, the time needed for all the nodes to learn all the k messages 
is 0{k + logn + ^max)) rounds for the synchronous and asynchronous time models, with probability 
of at least 1 — § . 
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The proof of Lemma [J is very similar to the proof of Theorem [H and rehes on reducing the prob- 
lem of algebraic gossip to a simple system of queues. The service time is geometrically distributed 
with a worst-case parameter P = The A is eliminated from p since each node chooses now a 
single communication partner. Then, using Theorem [2] we obtain the stopping time of algebraic 
gossip with on the tree r„. Detailed proof of Lemma [1] can be found in appendix. 

Since for every choice of the tree root, the depth of the tree (which was created using 
protocol t{S)) is bounded by its diameter, we can replace the Zmax in the bound 0(/c-|-logn + /niax)) 
with d{S). Now, we just add the stopping time of Phase 1 (the spanning tree time - t{S)) and 
the stopping time of Phase 2 (after Phase 1 has finished), and obtain that the number of rounds 
needed to complete the protocol TAG is 0{k + logn -|- d{S) + t{S)) w.h.p. □ 

4.1 TAG protocol using 1-dissemination as a spanning tree protocol 

The spanning tree task can be successfully performed by a simple gossip broadcast (or 1-dissemination) 
protocol. When a node receives for the first time the message, it marks the sending node as its 
parent. In such a way we obtain a spanning tree rooted at the node that initiated the broadcast 
protocol. Let us denote a gossip 1-dissemination protocol as B. Clearly, the result of Theorem d] 
can be rewritten as: t(TAG) = O {k + log n + d{B) + t{B)). An interesting observation regarding the 
broadcast protocol B, is that for synchronous time model the depth of the broadcast tree cannot 
be larger that the broadcast time (measured in rounds), i.e., t(B) > d(B). The last is true since a 
message can not travel more than one hop in a single round. Thus, for the synchronous time model 
we obtain that the number of rounds needed to complete the TAG protocol w.h.p. is: 

t{TAG) = 0{k + log n + t{B)). (3) 
5 Optimal All-to-all Dissemination Using TAG 

In this section we propose to use the TAG protocol in conjunction with a 1-dissemination (or 
broadcast) gossip protocol B-jm for spanning tree construction. For the case where k = i}{n) 
messages need to be disseminated, TAG with B-jm achieves order optimal performance. For the 
case k = ^}{n) the lower bound of any gossip dissemination protocol is J7(n) rounds. The bound 
from Theorem m gives t(TAG) = 0{k + logn + d{S) + t{S)), and if A; = n we obtain 0{n + t{S)). 
Thus, all we need to show is the existence of a gossip spanning tree protocol that finishes after 
0(n) rounds w.h.p. on any graph. 

Theorem 5. For any connected graph Gn, the stopping time of the broadcast protocol with the 
round-robin communication model - B-jm is 0{n) rounds. In the asynchronous time model, this 
result holds with probability of at least 1 — n(2/e)^", and in the synchronous time model, with 
probability 1. 

In order to prove Theorem [5] we need the following lemma which is proved in the appendix. 

Lemma 2. For any connected graph Gn with n nodes, the sum of the degrees of the nodes along 
any shortest path between any two nodes v and u is at most 3n. 

Proof of Theorem O In this proof we assume the PUSH gossip variation, but it is clear that the 
result holds also for EXCHANGE. Without loss of generality, assume that the message that needs 
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to be disseminated is initially located at the node v. In the round-robin gossip, when a node is 
scheduled to transmit, it transmits a message to its neighbor according to the round robin scheme. 

Consider a shortest path between v and some other node u. On the shortest path of length / 
there is exactly one node at the distance i from v, where i £ [0, . . . , and I < n — 1. Let di be the 
degree of the node at distance i from v. In order to guarantee the delivery of the message from v 
to u, we need Yli=o '^i transmissions in the following order: first, we need do transmissions of the 
node f , then di transmissions of the next node in the path v ^ u, and so on until the message is 
delivered to u. From Theorem [21 Yli=o ^« — 

In the asynchronous model, a node transmits at a given timeslot with probability ^. So, 
the number of timeslots until some specific node transmits is a geometric random variable with 
parameter ^. We define this geometric random variable as X, i.e., X ~ Geom {^)- 

The number of timeslots until 3n specific transmissions occur, is the sum of 3n independent 
geometric random variables. Using a Chernoff bound we obtain O(n^) timeslots (or 0(n) rounds) 
with exponential high probability. The last allows us to perform union bound for shortest paths 
to all other nodes in G, thus obtaining the 0(n) bound for the broadcast time. We omit here the 
formal part of the proof. The full proof can be found in the appendix. 

It is easy to see that in the synchronous time model, 3n specific transmissions will occur exactly 
after 3n communication rounds. E.g., after do rounds, v will perform do transmissions - each one 
to diff^erent neighbor (according to the round-robin scheme). Thus, the message will be delivered 
to u after at most 3n rounds with probability 1. □ 

Using Theorems H] and [5] we obtain the upper bound on the stopping time of TAG with B-jm 
as a spanning tree construction protocol: 0{k + logn + d{S) + n) which is 0(n) for k = ^l{n). 

6 Graphs with a Large Weak Conductance 

For values of k which are smaller than n we use the information spreading protocol (hereafter, IS) 
of [5], which requires only a polylogarithmic number of rounds for broadcast on graphs with large 
weak conductance. Roughly speaking, the weak conductance is a value in [0, 1] that measures the 
connectivity of subsets of nodes of a graph. It has been used to analyze the time required for partial 
information spreading, where each message is only required to reach some fraction of the nodes. 
This, in turn, has been applied in the analysis of the IS protocol to show that the running time for 
full information spreading inversely depends on the weak conductance. The graphs with large weak 
conductance, for which the IS protocol is fast, form a broad family of graphs, including graphs that 
exhibit some (though not too many) communication bottlenecks. A simple example is the barbell 
graph, consisting of two cliques of n/2 nodes, connected by a single edge, which corresponds to 
a bottleneck since information must pass along it, but the probability of randomly choosing it is 
small due to large node degrees. The IS protocol overcomes this and runs in a logarithmic number 
of synchronous rounds on the barbell. 

We describe this result for both the synchronous and asynchronous time models considered. 
Although the IS protocol is designed to disseminate n messages originating one at each node, 
we will only use it for obtaining a spanning tree of our communication graph, while the actual 
information dissemination is done using algebraic gossip (i.e., we use the TAG protocol with IS as 
the spanning tree construction protocol). This is since the IS protocol sends large messages, while 
the goal of algebraic gossip is to address bandwidth concerns. The spanning tree is constructed as 
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follows. The information sent by a node v is an n-bit string, characterizing the nodes from which v 
heard from, whether directly or indirectly. This corresponds to empty initial inputs, and initially 
the n-bit string of node v is a unit vector, characterizing only the empty input of the node v itself. 
The n-bit string maintained and sent by a node v is monotone, in the sense that as time passes, its 
entries can only change from zero to one. The spanning tree that is created corresponds to each 
node V declaring its parent as the first node u from which it received a message that caused its 
most significant bit to change from zero to one. This means that this node received the input of the 
node w corresponding to the most significant bit (recall that the input itself is an empty string). 
The following theorem characterizes the time required for the IS protocol to complete. 

Theorem 6 ( [5, Theorem 4.1]). For every c > 1 and every 5 G (0, l/3c), the IS protocol obtains 

full information spreading after at most 0(c( ^°^ ^"^^Ig)^*^ — ^ + c)) rounds, with probability at least 
1 - 3c6. 

In the synchronous model we can use the IS protocol in the TAG protocol, directly obtaining 
the following theorem, which shows optimality of TAG for certain families of parameters. 

Theorem 7. Let c = 0(log^(n)) for some p > 0, let G be a graph with weak conductance = 
^( logP (n) and let k = ^l{log'^^^^ ('t-))- With probability at least 1 — ^, the time for disseminating 
k messages using protocol TAG in conjunction with the IS protocol is Q{k) synchronous rounds. 

We show that the IS protocol works in the asynchronous model as well. While this is not a 
direct usage of the protocol due to some subtleties, we nevertheless show how to obtain our result 
as for the synchronous model. Our analysis induces an overhead of 0(log^(n)) rounds. 

We do not change the protocol itself to cope with asynchrony, but rather analyze the time 
required using additional techniques. Roughly speaking, the outline of our analysis is showing that 
segments of the asynchronous execution simulate synchronous rounds. This allows us to use the 
original analysis of the protocol for the simulated rounds, which gives our result, as stated in the 
following theorem, and proved in the appendix. 

Theorem 8. Let c = 0(log^ (n)) for some p > 0, let G be a graph with weak conductance $c = 
^( logP (n) )^ and let k = Q{log^^~^^ l'^))- With probability at least 1 — ^, the time for disseminating 
k messages using protocol TAG in conjunction with the IS protocol is 0{k + /max) rounds for the 
asynchronous time model, where Imax is the depth of the spanning tree induced by the IS protocol. 

For completeness, we note that, in IS, during the even- numbered steps of each node the choice 
of neighbor to contact is randomized. For these steps alone, adapting the analysis Mosk-Aoyama 
and Shah [22] for the asynchronous case to our protocol, implies that the extra log (n) time slots 
can be avoided for the purpose of partial information spreading alone (as used in the proof of the 
information spreading protocol (see [U Theorem 2.2]). However, as this cost is required anyhow to 
argue about the deterministic choices, made during the odd-numbered steps, we omit going through 
this adjustment. 
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Appendix 

Table of notations 



n 


Number of nodes 


k 


Number of messages needed to be disseminated 




Connected graph with n nodes 




Connected Tree graph with n nodes 




Network of n queues arranged in a tree topology 


Qline 


Network of Zmax queues arranged in a line topology 


D 


Diameter of a graph 


N{v) 


Set of neighbors of the node v 


dv 


Degree of the node v {dy = \N{v)\) 


A 


Maximum degree of the graph (A = max^ dy) 


timeslot 


Unit of time in the asynchronous time model 


round 


Unit of time in the synchronous time model (1 round = n timeslots) 


S 


Some spanning tree gossip protocol 


B 


Some broadcast (1-dissemination) gossip protocol 


d{S), d{B) 


Diameter of the spanning tree created by the protocol 


nn 


Round-robin communication model 


Bun 


Broadcast gossip algorithm based on the round-robin communication model 


^max 


Depth of the tree created by a broadcast protocol 


TAG 


fc-dissemination protocol that uses algebraic gossip and a spanning tree protocol 


t(TAG), t{S), t{B) 


Stopping time of a protocol 




Stopping time of a queuing system - time by which all customers leave the system 



Table 3: Table of notations 



Proof of Lemma [T] 

Lemma [1] (restated). Let be a tree with n nodes, rooted at the node r, with depth /max- There are 
k initial messages located at some nodes in the tree. Consider algebraic gossip EXCHANGE protocol 
with the following communication model: the communication partner of a node is fixed to be its 
parent in T„ during the whole protocol. Then, the time needed for all the nodes to learn all the k 
messages is 0{k -\- \ogn -\- /max)) rounds for the synchronous and asynchronous time models, with 
probability of at least 1 — ^. 

Proof. The proof is very similar to the proof of Theorem [H and relies on reducing the problem of 
algebraic gossip to a simple system of queues. 

On Tn, consider a message flow towards an arbitrary node v {not necessary the root of T„) 
from all other nodes. Once k helpful messages arrive at v, it will reach the rank k and finish the 
algebraic gossip protocol. Due to the proposed communication model, every node in r„ has a fixed 
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communication partner - its parent, so, each edge e in the tree has at least one node which wih 
issue, on its wakeup, a bidirectional communication (EXCHANGE) over e. Thus, from every node, a 
message can be sent towards v. We ignore messages that are not sent in the direction of v. Ignoring 
part of messages can only increase the stopping time of the algebraic gossip protocol. 

As in the proof of Theorem [H we define a queuing system Q^^"^ by assuming an infinite queue 
with a single server at each node. The root of Qn*^*^ ^i^l be an arbitrary node v, and let 1^^,^ be 
the depth of the tree Q^'^^- 

The service procedure at a node is a transmission of a helpful message towards the node v. 
In our communication model, the communication partner of a node is always its parent in the 
tree. Thus, in the EXCHANGE gossip variation, in the asynchronous time model, in a given timeslot, 
a helpful message will be sent over the edge in a specific direction with probability of at least 
(1 — ^)/'n, where ^ is the probability that a given node wakes up in a given timeslot, and 1 — | is 
the minimal probability that the message will be helpful. Thus, we can consider that the service 
time in our queuing system is geometrically distributed with parameter p > (1 — ^)/ti, and since 

g > 2, we can assume the worst case: P = 

Using Theorem[2]for the tree T„ rooted at v, with fi = p,we get an upper bound for the stopping 
time of the node v, ty = 0{{k + 1^^^^ + log n)2n) timeslots with probability of at least 1 — where 
the ZJ^a^x is the depth of the tree T„ rooted at v. Since /J^^x ^ 2/max (where Zmax is the depth of T„ 
rooted at r), we can replace the ZJ^^x with 2Zinax- 

So, using union bound, we obtain the upper bound (measured in timeslots) for all the nodes in 




= 0((/c + logn + /i^ax)2n) ) > 1--. (4) 

As in the proof of Theorem [H in the synchronous time model, the service time distribution 
parameter p will be larger by a factor of n, and the time will be measured in rounds instead of 
timeslots. Thus, using the same arguments as in the proof of Theorem [H we obtain the upper 
bound of 0{k + logn + /max) rounds for the synchronous time model. Thus, the lemma holds for 
both time models. □ 



Proof of Lemma [2] 

Lemma [2] (restated). For any connected graph Gn with n nodes, the sum of the degrees of the 
nodes along any shortest path between any two nodes v and u is at most 3n. 

Proof. Without loss of generality, consider a BFS spanning tree of G rooted at some node v, and 
some arbitrary leaf u. We will find the maximum degree of the node located on the path [v — )• u) 
at distance i from the root v. Clearly, such a node can be connected only to the following nodes: 

• Nodes that are located at distance i — 1 from the root. (It can not be connected to the nodes 
that are closer to the root (than i — 1) since then, its distance from the root would be i — 1 
which contradicts the given BFS execution.) 

• Nodes that are at the same distance i from the root. 
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• Nodes that are located at distance i + 1 from the root. (It can not be connected to the nodes 
that are farther from the root (than i + 1) since then, their distance from the root would be 
i + 1 which contradicts the given BFS execution.) 

Let us define rui as the number of nodes at distance i from the root. Clearly, Yl^=o ~ C^^^ 
node at distance is the root v). The degree of a node (at distance i from the root) can be at most: 
di < {rrii-i +mj + mj+i). Thus, the sum of degrees on a path of length I from the root to a leaf is at 
most: d = J2i=o Since / < n - 1, d = XlLo '^i - Y17=o = J2i=oi'^i~^ +mi+ mj+i) < 3n. □ 

Proof of Claim [1] 

Claim [1] (restated). For any connected graph Gn with a constant maximum degree (A = 0(1)), 
the diameter of Gn is il(logn). 

Proof. Let us sum up all the n vertices of G„ in the following way. We start with an arbitrary node 
V and count it as 1. Then we split the sum of n vertices into D parts, where D is the diameter 
of Gn- Each part represents number of vertices located at the distance i {i £ [0,..,D]) from the 
node V. Since we are interested in the lower bound on D, we can assume the maximum degree for 
every node (so, the number of parts in the sum will be minimal). We define (i G [0, .., D]) as the 
number of vertices located at the distance i from the node v. Thus we obtain: 

no + ni + n2 -\ \-no = n (5) 

1 + A + + • • • + > n (6) 
A^+2 - 1 

^3^>n (7) 

A^+2 > n (8) 
D + 2> logA n (9) 
D = n{logn) (10) 

□ 



Stochastic Dominance 

Definition 4 (Stochastic dominance, stochastic ordering \15\ I12j). We say that a random variable 
X is stochastically less than or equal to a random variable Y if and only i/Pr(X < t) > Pr(y < t), 
and such a relation is denoted as: X . 

Claim 2. // for i G {1,2}, Xi ^ Yi, Xi are independent, and Yi are independent, then: maxjXj ■< 
maxj Yi . 

Proof. 

Pr(maxXi < t) = [^Pr(Xi <t)= JJPr(Xi < t) 

i i 

> PJPr(yi <t)= Pr(maxyi < t). 
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Hence: 



□ 



Claim 3. If for i G {1,2}, Xi :< Yi, Xi are independent, and Yi are independent, then: "Yli^i ^ 



Proof. 



Thus: 



Hence: 



Pr(Xi + X2<t)= [ {s)ds, 
J —00 

where fxi+Xii^) = fxi{s) * fxii^)- 

/t poo 
/ fxAr)fxAs-r)dTds (11) 
-00 J —00 

/oo 
fxArWX2<t-T)dT (12) 
-00 

/oo 
/xi(r)Pr(y2 <t-T)dr (13) 
-00 

/t poo 
/ fxAr)fY,{s-T)dTds (14) 
-00 J —00 

/t POO 
/ fYAr)fxAs-r)dTds (15) 
-00 J —00 

/oo 
/y, (r) Pr(Xi <t-r)dr (16) 
-00 

/oo 
/y,(r)Pr(yi<i-r)dT (17) 
-00 

/t poo 
/ fY,{r)fYAs-r)dTds (18) 
-00 J —00 

= Pr(Yi + ^2 < *)• (19) 



1=1 i=i 



□ 



Later cirrivals yield later depeirtures 

Consider an infinite FCFS queue with a single exponential server. We define Oj as the time of 
arrival number i to the queue, and di as time of the departure number i from the queue. Let X^ 
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be the exponential random variable representing the service time of the arrival i. For all i, Xi's are 
i.i.d. 

Let be a sequence of m arrival times to the queue, and di be a sequence of m departure times 
from the queue. 




ai 02 as 04 as 



















X2 




X3 




Xi 


X5 

































t 



di d2 ds ds 



di = max(ai, di-i) + Xi 
Figure 2: Arrival and departure times. 

Lemma 3. If the sequence is replaced with another sequence ofm arrivals - di, such that: di ^ aj 
Vi G [1, ...,m], then, the resulting sequence ofm departures will be such that: di ^ di\/i £ [1, ...,m]. 
I.e., if every new arrival occurred , stochastically, at the same time or later than the old arrival, 
then, every new departure from the queue will occur , stochastically, at the same time or later than 
the old departure. 

Proof. The proof is by induction on the arrival index j, j G [1, ...,m]. 

• Induction basis: di ^ di. Follows since di = ai + Xi, di = di + Xi, and ai ^ ai. 

• Liduction assumption: \/i < j : di ^ di. 

• Induction step: we need to show that dj ^ dj. 

If the j's arrival occurred when the server was busy, then dj = dj-i + Xj. If the server was idle 
when the j's arrival occurred, then dj = aj + Xj. Thus, we can write: 

dj = max((ij_i, Oj) + Xj, (20) 
and dj = max((ij_i, cij) + Xj. (21) 

Since from induction assumption: dj-i ^ dj-i, and dj ^ Oj, using Claims [2] and El we obtain 
dj ^ dj. □ 

Proof of Theorem [2] 

Theorem [2] (restated). Let Q^^'^^ he a network of n nodes arranged in a tree topology, rooted at 
the node v. The depth of the tree is ?max- Each node has an infinite queue, and a single exponential 
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server with parameter fi. The total amount of customers in the system is k and they are initially 
distributed arbitrarily in the network. The time by which all the customers leave the network via 
the root node v is t{Q^^^^) = 0{{k + /max + logn)///) timeslots with probability of at least 1 — 

Proof. We denote the nodes of the queuing system Q^^'^^ as Zj, where / (/ G [1, /max]) is the level 
of the node in the tree, and j is the node's index in the level /. The root of the Q^^*^ tree is the 
node Zl. Ah servers in the Q^^^^ network are ON all the time (work-conserving scheduling), i.e., 
servers work whenever they have customers to serve. There are no external arrivals to the system. 
Once a customer is serviced on the level /, it enters the appropriate queue at the level 1 — 1. When 
a customer is serviced by the root Z\, it leaves the network. 

Now, let us define the auxiliary queuing systems: Q^n^^ and Q;™*^- 

Definition 5 (Network Q^^^^). Q^n^^ is the same network as Q^'^^ with the following change in the 
servers' scheduling: 

At any given moment, only one server at every level I (I G [1, Imax]) is ON. Once a customer 
leaves level I, a server that will be scheduled (turned ON) at the level I, is the server which has in 
its queue a customer that has earliest arrival time to a queue at the level I among all the current 
customers at the level I. If there are customers that initially reside at the level I, they will be serviced 
by the order of their IDs (we assume for analysis that every customer has a unique identification 
number). 

Definition 6 (Network of queues Qf"^). Q^*""^ is the the following modification of the network 
Qn^^, that results in a network o//max queues arranged in a line topology. 

For all I G [1, ■.,/max]; we merge all the nodes at the level I to a single node (a single queue with 
a single server). We name this single node at the level I as the first node in Q^^^ o,t the level I, 
i.e., Z\. The customers that initially reside at level I will be placed in a single queue in the order of 
their IDs. This modification results in Q^i^^^ ~ network of /max queues arranged in a line topology: 

Definition 7 (Network of queues Ql^^^J)- Qi(^^^ ~ is the same system as Qf2_^^ with the following 
modification. We take the last customer at some node Z"^ (m G [1, ..,/max — 1]) and place it at the 
head of the queue of the node Z^~^^ . I.e., we move one customer, one queue backward in the line 
of queues. 

Definition 8 (Network of queues Qf^^J. Q^f^L ~ 

same system as Qf^^^ with the following 
modification. We move all the customers to the queue Zj'"'"'. I.e., all the customers have to traverse 
now through all the /max queues in the line. 

We summarize the queuing systems defined above in the short Table HI 

The proof of Theorem [2] consists of showing the following relations between the stopping times 
of the queuing systems: 

t{Qtn < t{Qtn ^ t{Qt:j < tiQ^CJ ^ = 0{{k + logn + /max)/M). 

Stopping time of a queuing system t{Q), is the time by which the last customer leaves the 
system (via the node Z\). In order to compare the stopping times of queuing systems, we define 
the following ordered set (or sequence) of departure time from a server Z in a queuing system Q: 
d{Z,Q) = {di{Z,Q),d2{Z,Q), ...,di{Z,Q), ...), where di{Z,Q) is the time of the departure number 
i from the node (server) Z. 
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Qtree 


Original system of n queues arranged in a tree topology. Fig. |3] (a) . 




System of n queues arranged in a tree topology. Only one server is 
active at each level at a given time. Fig. [3] (b). 


Qline 


System of l^aiL queues arranged in a line topology. Fig. [3] (c) . 


Qline 


System of /max queues arranged in a line topology. One customer is 
moved one queue backward. 


Qline 


System of /max queues arranged in a line topology. All customers are 
moved backward to the queue Zj™"". 



Table 4: Queuing systems used in the proof. 



First, we want to show that the stopping time of Qn^*^ is at most the stopping time of the 
system Qt'\ i.e., t{Qtn < tiQu^- 

Lemma 4. In Qn'^^, every departure from the system (via Z\) will occur, stochastically, at the 
same time or later than in Q^^'^' 

di{ZlQtn ^ d,{ZlQtn V^ G [l,...,fc]. (22) 

Thus, in Q^ '^^, the last customer will leave the system, stochastically, at the same time or later 
than in Q^^^^ , or: t{Q^^'^^) ^ 

Proof. The proof is by induction on the tree level /, / G [1, ...,/max]. 

• Induction basis: Vi,j : Q^'^^^) h Q^''^^). This is true since in Q^'■^^ the 
nodes do not work all the time, and thus the departures will occur, stochastically, at the 
same time or later than in Q^^^- If there is a single node at the level Zmaxj in Qn"^^ it '^ili be 
ON all the time as in Qn^'^, and thus, the departures will occur, stochastically, at the same 
time in both systems. 

• Induction assumption: for all / > m (m > 1), ViJ : di{Zj,Q^^'"') >z di{Z^j , Qn""^) . 

• Induction step: we need to show that: Vi,j : di{Zf ^Q^^""") >z di{Zf ,Q*^''''). 

By induction assumption, for / = m + 1: Vi, j : di{Zf+^, Q^''^^) ^ Q^^'^^). Now let us take 

a look at the departures from a node ZJ^. There are two cases: ZJ^ is a leaf, and ZJ^ is not a leaf. 
If ZJ^ is a leaf, we can use the same argument as in the induction basis: in Q^*^^, the node ZJ^ does 

not work all the time, and thus the departures from it in Q^^^^ cannot occur earlier than in Q^^^- 
If Zj" is not a leaf, it has input /inputs of arrivals from the level m + 1. Since the arrivals from 
the level m + 1 in Q^'^^ occur, stochastically, at the same time or later than in Q^*^^ (by induction 
assumption), even if the node Zj" would work all the time (as in Q^^^), we would obtain from 
Lemma E) ViJ : di{Zf,Q^^'''') >z di{ZJ',Q^;-''''). Moreover, in g^'•^^ the node ZJ" does not work 

all the time (unless it is the only node at the level m), thus the departure times in Q^^^ can be 
even larger. □ 

Lemma 5. In Qf^^^, every departure from the system (via Z\) will occur, stochastically, at the 
same time as in Q^'^'^- Thus, in Q^i^^^; ihe last customer will leave the system, stochastically, at 
the same time as in Q^^^- 
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(a) - Network Q^' 



(b) - Network Q*„' 



(c) - Network Qf 



Figure 3: (a) - Network Q^*^*^, where all the servers work all the time, (b) - Network Q^'^'^, where 
only one server at each level works at a given time, (c) - Network Q^"*^ . 

^ ^ tmax 

Proof. Consider the two following facts regarding the network Q^^*^. First, a customer entering 
the level I will be serviced after all the customers that arrived to the level / before it, are serviced. 
Second, at any given moment, only one customer is being serviced at the level I (if there is at least 
one customer at the nodes Zj). These facts are true due to the scheduling of the servers in Qn*^^ 
(Definition [5]). 

Clearly, the same facts are true for the network First, any customer entering to the level 

I will be serviced after all the customers that arrived to the level / before it, are serviced. Second, 
at any given moment, only one customer is being serviced at the level I (if there is at least one 
customer in the node Z[). These facts are true since in Q;™^, at every level, there is a single queue 
with a single server (Definition [6]) . 

So, the departure times of every customer from every level / (Z G [1, ^max]) are, stochastically, 
the same in both systems. The departures from level I = 1 are the departures from the node Z|, 
and thus the lemma holds. □ 



Now we are going to move one customer, one queue backward and will show that the resulting 
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(a) - Network Q^i^l^ 



all the k customers 



(b) - Network QJ™ 



(c) - Network Q[;^ 



^1 
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Figure 4: (a) - Network Q\^^^^- (b) - Network Qn^^' where one customer is moved one queue 
backward, (c) - Network Qi*"-^ , where all the customers are at the last queue. 



system will have stochastically larger (or the same) stopping time. 

Lemma 6. Consider a network Q'*"*^- Let m be a level index: m G [1, ..,Zmax — !]■ 
last customer at the node ZJ" and place it at the head of the queue of the node Z^^^ , 
resulting network - Q\^^^^ (Fig- (b))- Then: 



We take the 
and call the 



(23) 



Thus, in Q^^ , the last customer will leave the system, stochastically, at the same time or later 
than in Q'*""7or.- t(Q|*"^ ) < tiQ^""^ ). 

Proof. We call the customer that was moved - customer c. Let us take a look at the times of 
arrivals to the node ZV^ in Qi*"*^ and in Q^"*^ . Since the customer c is already located in the 

^ 'rnax '■max 

queue of ZV^ in Q?*"^ , its arrival time can be considered as 0. In Q¥"-^ , the arrival time of c is at 

J- ^ tmax tmax 

least (it should be serviced at Z^^^ before arriving at Z™). Each one of the rest customers that 
should arrive at will arrive in Qi^^^, stochastically, at the same time or later than in Q;^^^, 



since in Qi*"^ the server Zl""*"^ should first service the customer c, and only then will start servicing 
the rest customers. Thus, di(Z™+\ Q'^'^n >- di(Z'^^\ Q^'^^n . Using Lemma [3] we obtain that: 

!,...,!], we 
□ 



di{Z^,Q\^^J ^ di{Z^,Q\^^J. Iteratively applying Lemma [3] to the nodes Z^ I G [. 
obtain the result: dAZlQ]^''^ ) ^ di(Z|, Q'*"^ ). 



Corollary 1. Consider a network Qi^^^ {Definition\W which is identical to the network Q¥"'^ with 

tmax * *^ tmax 

the following change. In 0'*""^, clU the k customers are located at the node zj™'''' (Fig. [7] (c))- 
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Then: 



d,{Zl QtTx) ^ diiZlQtL) e [1, k]. (24) 



Thus, in Qj*"^ , the last customer will leave the system, stochastically, at the same time or later 
than in Q'*"'^ , or: tiOf'''' ) < tiOf'^''). 

'rnax ^ tmax ' ^ ^max ' 



Proof. Given the network Q;*"*^ we take one customer from the tail of some queue (except the 
queue of the node Zl™"") and place it at the head of the queue of the preceding node in the Q^^l^- 
According to the Lemma [6l we get a network in which every customer leaves via Z\, stochastically, 
not earlier than in Q/™*^- Iteratively moving customers (one customer and one queue at a time) 
backwards we get finally the network Q,*"'^ in which all the k customers are located at the node 

tmax 

Since at each step, according to Lemma [6l the departure times from Z\ could only get, 
stochastically, larger, the lemma holds. □ 

Corollary 2. The time it will take the last customer to leave the network of n queues arranged in 
a tree topology is, stochastically, the same or smaller than in the network of n queues arranged in 
a line topology where all the k customers are located at the farthest queue, i.e., t{Q^^^'^) < ^(0|^J^)- 

Proof. This corollary is a direct consequence of the Lemmas IH [5l and the Corollary [T] □ 

Now we are ready for the last step of the proof. We will find the stopping time of a system of 
queues arranged in a line topology and with all the customers located at the last queue. 

Lemma 7. The time it will take to the last customer to leave the system Qf^^^ (Imsix MMl queues 
arranged in a line topology) is 0{{k + logn + /max)/^) with probability of at least 1 — 

Proof. Initially, all the customers (from now we will call them real customers) are located in the 
last [Z^^'^'^) queue. We now take all the real customers out of this queue and will make them 
enter the system (via the Z^^'^^) from outside. We define the real customers' arrivals as a Poisson 
process with rate A = ^. So, p = ^ = ^ < 1 for all the queues in the system. Clearly, such an 
assumption only increases the stopping time of the system (stopping time is the time until the last 
customer leaves the system). According to Jackson's theorem, which proof can be found in [7j, 
there exists an equilibrium state. So, we need to ensure that the lengths of all queues at time i = 
are according to the equilibrium state probability distribution. We add dummy customers to all 
the queues according to the stationary distribution. By adding additional dummy customers to the 
system, we make the real customers wait longer in the queues, thus increasing the stopping time. 

We will compute the stopping time ^(Q|™'^) in two phases: Let us denote this time as ti + 12, 
where ti is the time needed for the A;'th customer to arrive at the first queue, and t2 is the time 
needed for the A;'th customer to pass through all the /max queues in the system. 

Prom Jackson's Theorem, it follows that the number of customers in each queue is independent, 
which implies that the random variables that represent the waiting times in each queue are inde- 
pendent. To continue with the proof we need the following lemmas; the first is a classical result 
from queuing theory, the proof of the second lemma is omitted. 

Lemma 8 (|24j. section 4.3). Time needed to cross one M/M/1 queue in the equilibrium state has 
an exponential distribution with parameter /x — A. 
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Lemma 9. Let Y he the sum of n independent and identically distributed exponential random 
variables. Then, for a > 1: 

Pr {Y < aE[Y]) > 1 - (2e-"/2)". (25) 

The random variable ti is the sum of k independent random variables distributed exponentially 
with parameter ij,/2. Prom Lemma [8] we obtain that t2 is the sum of ^max independent random 
variables distributed exponentially with parameter /x — A = fi/2. E [ti] = Yli=i ^//^ = 2fc//U, and 
by taking a = 2 + 4^2|p, we obtain: 

(26) 

Pr {ti < (4A; + 8 In n)/n) > 1 - (2e"(2+^^)/2)^ (27) 

= 1_ (D'^e-^i^" (28) 
>^_g-21nn ^29) 

>^-^- (30) 

In a similar way we obtain: 

FT{t2<{4lmax+8lnn)/fl)>l-^. (31) 

^/•Qhne ^ = ^ _|_ ^ thus, usine union bound: 

^ 'max ^ / ^ w 

Pr (ti + t2< (4A: + Al^ax + 16 In n)/fi)>l-^ (32) 
and thus: 

t(QtD = 0{{k + /max + log n)//i) (33) 
w.p. of at least 1 
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□ 



From Claim [2] we obtain that t(Qr^) ^ ^(Ql^Jl) and thus: t(Qr^) = 0((/c + W + logn)/^) 
w.p. of at least 1 — □ 

Proof of Theorem [5] 

Theorem O (restated). For any connected graph Gn, the stopping time of the broadcast protocol 
with the round-robin communication model - B-jm is 0{n) rounds. In the asynchronous time model, 
this result holds with probability of at least 1 — n(2/e)^"', and in the synchronous time model, with 
probability 1. 

For the proof we need the following lemma: 

Lemma 10. Let X be a sum of m independent and identically distributed geometric random vari- 
ables (each one with parameter p > 0) and E[X] = ^. Then, for a > l.- 
Fi {X < aE[X]) > I - {ae^-")"" . (34) 
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Proof. First, we will define Y as the sum of k independent Bernoulli random variables, i.e., Y = 
Yli=i ^5 where Yi ~ Bernoulli{p). Let us notice that: 

Pr {X <k)= Pr {Y > m) (35) 

The last is true since the event of observing at least m successes in a sequence of k Bernoulli trials 
implies that the sum of m independent geometric random variables is no more than k. On the 
other hand, if the sum of m independent geometric random variables is no more than k it implies 
that m successes occurred no later than the k-th trial and thus Y > m. 

Now we will use a Chernoff bound for the sum of independent Bernoulli random variables 
presented in [21] : For any < 6 < 1 and fi = E[Y]: 

P^(^<(l-^)/^)<( (l (36) 
Since /i = E [y] = kp, and by letting 6 = -^^^^ we obtain: 

/ TTl \ 

Pr(r<(l-^)/x)=Pr(y<m)< . (37) 

\ e "I kp J 

( \ 

Pr(y>m)>l- (38) 
\ e kp ) 

Vr{X<k)>\-[^^ (39) 
\ e m kp J 

By substituting k = ay = aE [X] (where a > 1) we obtain: 

a \ —m 



Pr {X <aE[X])>l - y—j (40) 

□ 

Proof of Theorem [3 In this proof we assume the PUSH gossip variation, but it is clear that the 
result holds also for EXCHANGE. 

Without loss of generality, assume that the message that needs to be disseminated is initially 
located at the node v. In the round-robin gossip, when a node is scheduled to transmit, it transmits 
a message to its neighbor according to the round robin scheme. I.e, at every transmission a message 
is sent to a different neighbor. 

Consider a shortest path between v and some other node u. On the shortest path of length / 
there is exactly one node at the distance i from v, where i G [0, . . . , Z], and I < n — 1. Let di be the 
degree of a node at the distance i from v. In order to guarantee the delivery of the message from 
V to u, we need Yl\=o ^« transmissions in the following order: first, we need at do transmissions of 
the node v, then di transmissions of the next node in the path v ^ u, and so on until the message 
is delivered to u. From Theorem [21 X]!=o — 

In the asynchronous model, a node transmits at a given timeslot with probability ^. So, 
the number of timeslots until some specific node transmits is a geometric random variable with 
parameter ^. We define this geometric random variable as X, i.e., X ~ Geom (i). 
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The number of timeslots until 3n specific transmissions occur, is the sum of 3n independent 
geometric random variables. Using Lemma [10] we obtain the bound of O(n^) timeslots (or 0(n) 
rounds) with exponential high probability. The last allows us to perform union bound for shortest 
paths to all other nodes in G, thus obtaining the 0{n) bound for the broadcast time. 

Now, let us write the above more formally: 

We define t^^u as the time it takes to guarantee the delivery of the message from the node 
V to an arbitrary node u. As we showed above, t^^u is the number of timeslots until 3n specific 
transmissions occur, so: 

3n 



U^^ = J2Xi^ (41) 



i=l 



Xi\/i £ [1, . . . , 3n] are i.i.d. and distributed as X, 



Thus, 

From Lemma [TOl with a = 2 : 



3n 



.i=l 



where X ~ Geom ( — 
n 



3n 3n 



^E[Xi]=^n = 3n^. (42) 



1=1 1=1 



Pr (f,^„ < 2E [t,^ J ) > 1 - (2/e)3" , (43) 



or 



Pr(t,^„<6n2) >l-(2/e)3«. (44) 

Now, we will apply a union bound on probabilities of the events: t^^u' > 6n^, where u' S V. 
Notice, that E [t^^u] = Sn^ for ah u' € V. 



so, 



Pr IJ it,^u'>Qn^) < Pr(t 
\u'ev J u'ev 

y {t,^u' > 6n2) J < n(2/e) 



(45) 



Pr ( t J (t^^u' > 6n2) ) < n(2/e)3". (46) 

Thus, 



Pr ( f] {t^^u' < 6n2) ) > 1 - n{2/e) 
\u'ev J 



So, we obtain the result of O(n^) timeslots, or 0{n) rounds. 

Easy to see that in the synchronous time model, 3n specific transmissions will occur exactly 
after 3n communication rounds. E.g., after do rounds, v will perform do transmissions - each one 
to different neighbor (according to the round-robin scheme). Thus, the message will be delivered 
to u after at most 3n rounds with probability 1. □ 
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Proof of Theorem [8] 



Theorem [8] (restated). Let c = 0(log^ (n)) for some p > 0, let G be a graph with weak conduc- 
tance = ^( logp (n) ^^'^ ^ ~ 0(log^^'''^ (^))- With probability at least 1 — the time for 
disseminating k messages using protocol TA.G in conjunction with the IS protocol is 0{k -t- /max) 
rounds for the asynchronous time model, where /max is the depth of the spanning tree induced by 
the IS protocol. 

Proof. To simulate one round of a synchronous protocol, we consider the execution of the protocol 
for 0(n log (n)) time slots, which is log (n) asynchronous rounds. With high probability, \ — 5 for 
some small (5, each node takes at least one step. This follows from a standard coupon collector's 
argument, as steps of each node correspond to a different coupon. The crucial property of the 
information spreading protocol that allows our analysis to go through is its monotone nature, that 
is, the information collected and sent by a node is an n-bit string whose entries can only turn 
form zero to one as time passes. This implies that whenever each node took at least one step, 
the strings obtained can only contain more one entries than the strings obtained by one round of 
the synchronous model (recall that the goal is for all nodes to obtain a string of ones). Hence, 
after 0(T log (n)) asynchronous rounds, the information the nodes have is at least the information 
that they have after T rounds in the synchronous model. This does not yet conclude the proof, 
for the following reason. The analysis of the synchronous protocol goes through in this simulation 
except for one argument [H Claim 1], which bounds the size of the deterministic list of subset of 
neighbors that is maintained by a node v. This size is bounded by the number of steps taken by 
V. On one hand, we need the number of steps taken by v in each 0(n log (n)) time slots to be 
at least one to argue the simulation, but on the other hand, it may be that a node takes a larger 
number of steps. This would imply that its list is larger than in the corresponding synchronous 
case. However, returning to the coupon collector's problem, we have that actually no node takes 
more than 0(log (n)) steps in each 0(n log (n)) time slots (within the same high probability). This 
implies that the size of the list a node maintains is at most a multiplicative factor of 0(log (n)) 
larger than its size after T synchronous rounds. Since this size is used, in turn, to bound the number 
of synchronous rounds required, we have to add an additional 0(log(n)) factor to the number of 
rounds in the asynchronous model. 

Finally, we note that the probability of failure of the coupon collector's argument (in either the 
lower or upper bound on the number of steps per node) needs to be added up for all simulated 
rounds. When this number of rounds T is polylogarithmic in n, we have that using a union bound 
we remain with a high probability for the entire argument. □ 



28 



